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NECESSARY  AND  SUFFICIENT  CONDITIONS 


FOR  OPTIMAL  CONTROL  OF  QUASILINEAR 
PARTIAL  DIFFERENTIAL  SYSTEMS 
by 

N.A.  Derzko,  S.P.  Sethi,  and  G.L.  Thompson 

ABSTRACT 

This  paper  studies  the  problem  of  optimal  control  of  quasilinear 
partial  differential  systems.  A  maximum  principle  necessary  condition 
Is  derived  and  shown  to  be  sufficient  when  the  Hamiltonian  and  boundary 
conditions  satisfy  a  convexity  condition.  The  derivation  is  made  using 
Stokes  theorem  and  a  generalized  version  of  Green's  theorem.  Management 
science  applications  are  suggested  to  problems  of  cattle  ranching, 
perishable  inventories,  manpower  planning,  and  advertising. 
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1 .  Introduction 


In  this  paper  we  deal  with  the  optimal  control  problem 


max  J(u) 


LP  (x»u,t)dw  +  I  G(x,t)  •  dc 


(1.1) 


subject  to 


A{t) x  ■  f(x,u,t)  (1.2) 

where  t  «  fl  £  Rn ,  u(t)  <  K  <=  Rm,  A(t)  is  a  linear  partial 
differential  operator  and  3fi  denotes  the  boundary  of  (2  as 
usual.  We  assume  x  is  scalar  valued  for  simplicity  in 
exposition.  Generalisation  to  vector  valued  x  is  possible. 
Suitable  boundary  conditions  for  (1.2)  will  be  introduced 
later.  We  call  (1.2)  quasilinear  because  it  is  linear  in  the 
derivatives  of  x  (but  x  Itself  can  appear  nonlinearly) . 

We  derive  the  maximum  principle  for  this  problem  using  methods 
which  treat  all  the  components  of  t  on  an  equal  basis, 
thereby  allowing  more  generality  in  certain  directions  than 
the  usual  approach  to  distributed  parameter  systems.  In 
particular  ft  does  not  need  to  be  a  cylinder  set.  Further* 
more  this  approach  uses  Stokes  Theorem  and  a  generalized 
version  of  Green's  theorem.  It  leads  to  a  clean  concise 
methodology  which  can  take  advantage  of  existence  theory  when 
available,  and  yields  useful  formulas  even  when  existence 
theory  is  not  available.  Finally,  by  making  a  convexity 


assumption  on  tha  Hamiltonian  and  boundary  conditions  as  in 
(2/  7,  8]  w«  obtain  a  sufficiency  rssult  as  wall. 

There  are  numerous  classical  optimal  control  problems 
currently  handled  as  distributed  parameter  systems  —  optimal 
heating  of  solids,  efficient  extraction  of  crude  oil  from  oil 
fields,  optimal  drying  processes,  and  optimally  controlled 
large  scale  chemical  reactions,  to  name  a  few.  Governing 
differential  equations  and  typical  objective  functions  for 
such  problems  are  described  in  [4J .  The  results  of  this 
paper  will  allow  solution  for  more  general  geometries  and 
time  profiles  than  hitherto  possible. 

In  addition  this  theory  is  applicable  to  several  new 
areas  in  management  science,  for  example:  cattle  ranching 
[5];  perishable  inventories  [3];  manpower  planning 
[Gaimon -Thompson,  unpublished] ;  and  advertising  problems 
[Seidman-Sethi-Derzko,  unpublished] .  Many  other  applications 
are  possible  given  the  generality  of  the  partial  differential 
operator  and  objective  functional  used  in  this  work. 


Preliminaries 


To  make  this  paper  more  generally  accessible  we  review 
some  concepts  and  results  of  general  geometric  integration 
theory  [9] . 

In  this  paper  we  shall  deal  with  a  standard,  domain  8  <=Bn 
i.e.  one  to  which  the  general  Stokes  theorem  applies;  see  [9]. 

The  definition  of  standard  domain  includes  that  8  must  be 
connected,  we  impose  the  additional  restriction  that  the 
boundary  38  is  made  up  of  a  finite  number  of  pieces,  each 
of  which  is  a  differentiable  image  of  a  connected  open  set 
in  Hn-1.  This  requirement  provides  us  with  a  parametrization 
of  the  hypersurface  38,  useful  for  evaluating  integrals  on 
38.  Of  course  further  generalization  of  (2  is  possible  hut 
will  not  be  considered  in  this  paper. 

We  denote  a  point  in  8  by  t  -  (tx  ,  tn) .  The 

volume  integral  over  8  of  a  scalar  valued  function  <(>  (t) 

will  be  written  |  $(t)dw,  where  dw  »  dt.dt„  . . .  dt  .  The 

Jfl  l  2  n 

surface  integrals  we  consider  are  generalizations  of  the 
common  two  dimensional  formula  j  j  •  vdo  where  v  is  the 
usual  outward  normal.  In  the  n-1  dimensional  case  hyper** 
surface  integrals  (called  simply  surface  integrals  henceforth) 
are  written 

i  V  •  do  (2.1) 

•>38 


Here  V  •  do  is  an  n-1  differential  form  on  38.  More 
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specifically, 

¥  •  do  ■  E“-l  ¥s(t)do3  (2.2) 

where  each  H^Ct)  is  a  scalar  valued  function  on  3(2, 
do  «  (do1  ,  d o2  ,  ,  don)  and  do  ^  is  the  symbol 

dOj  -  (-l)^  dtx  dt2  ...  dt^_1  dtj+1  ...  dta  (2.3) 

Evaluation  of  (2.11  requires  us  to  choose  a  parametrization  of 

3(2  (or  a  piece  of  3(2)  say  t(s)  where  s  c  S  =  Rn~1 . 

Then  the  term  ¥.  do .  is  evaluated  by  the  formula 
'3(2  3  3 

|  do  j  *  (-1)3  |  ¥  j  (t  (s) )  v  j  (s)  dst  . . .  dsn_x 

where 


is  a  Jacobian.  The  order  of  appearance  of  the  functions 
tj(s)  in  (2.4)  is  determined  by  the  orientation  of  3(2  as 
a  manifold.  Again  we  refer  the  reader  to  [W]  for  details. 

Stokes  Formula  is 

f  d*  -  [  ♦  (2.5) 

'(2  '3(2 

where  ♦  •  ¥  •  do  is  a  differential  form  on  3(2.  Corresponding 
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wh«r« 


\k2  ...kr  *  3k,  3k2  ”•  3kt  • 


(2.9) 


Note  that  the  first  term  corresponding  to  i  *  1  in  the  integrand 
on  the  right  side  is 


Proof  outline 


We  have  from  (2.6) 


d[xy  dOj]  -  SjfxyJdc^  -  [(a^y+xtyijdw 


(2.10) 


We  begin  with  the  identity 


^3k1...krx)y  +  <_1,r  x^3k _ k_yJ 


(3k,  3k....k  x)y  *  (3k,...kx)lV) 

1  2  r  ^  c  x 


-^kj-.-k/Ks^y)  -  (3k3...krx)  ^kjkj*') 

+  (3k3...krx>  (3ktk2yl  *  t\...krx)  ^kjkjk/l 
+(-i)r'l[(3krx)[Jki...kt_iy)  *  x(3kl...k/l] 


(2.11) 


If  we  multiply  (2.1)  by  dw  and  apply  (2.10)  to  each  pair  of 
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L  i 
K  X 

K  - 


terms  on  the  right  we  obtain 


[(\...kr*)r  +  <-Dr'^(3ki...k]_y)]d»  . 


3-1 


k  k  yH3k  k  x^dffk 
Kj+1“  r  Kjj 


(2.12) 


which  when  substituted  in  (2.S)  or  (2.7)  yields  the  theorem. 

Next  we  introduce  linear  partial  differential  operators, 
their  ad joints  and  boundary  forms.  For  this  it  is  convenient 
to  mention  the  standard  notation  for  a  high  order  partial 
derivative.  If  J  *  (j1...jn)  is  an  n-vector  of  nonnegative 
integers  we  let 


a*  « 


jl  j2 
31  32 


.  a 


n 


(2.13) 


It  is  understood  that  a£  means  no  derivative.  The  order  |J| 
of  the  derivative  3J  is  defined  as  +  j2  +  •••  +  jn*  A 
linear  partial  differential  operator  A  is  given  by 


Ax{ t)  »  Ej  Sj(t)  3ux(t) 


(2.14) 


where  J  ranges  over  a  finite  set  S.  The  order  of  A  is 
maxJ<s  |J|  . 

Hilbert  spaces  constitute  a  useful  tool  for  dealing 

2 

with  linear  partial  differential  operators.  We  let  L  (ft) 
denote  the  usual  Hilbert  space  of  square  integrable  functions 


'  S' 
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on  ft.  If  f(t)  and  g(t)  are  real,  their  L  inner  product 
is  denoted 


(f,g)  ■  f  (t)  g(t)du»  .  (2.15) 

jft 

For  our  subsequent  work  we  derive  the  final  specialization  of 
Stokes  Theorem.  Suppose  in  (2.8)  we  let 

y(t)  »  aj(t)X(t)  (2.16) 


and  assume  that  aj(t)  is  sufficiently  smooth  to  present  no 
problems  in  taking  the  required  partial  derivatives. 
Substituting  (2.16)  into  (2.8)  we  obtain 


|  [xajOJx)  +  (-1)IJI'"1  3J(ajA)x]duj 

f  k  a  X)  (3 

J3ft  L  31  J 


'Tc  k  ^0 


La 


Vj(ajX,  x)  •  do 


(2.17) 


where  the  last  expression  defines  . 

Summation  over  J  yields  the  form  of  Stokes  Theorem 
to  be  used  in  the  next  section. 


[A  (Ax)  -  (A ’  A)  x]  dw 

T  (A,x)  *da 
'3ft  A 


(2.18) 
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where  the  formal  adjoint  operator  A'  is  defined  by 

A'X  »  Sj  (-1)  I 3J(aJA)  (2.19) 

and  the  boundary  form  rA  is  given  by 

rA(X,x)  -  Ej  TjtajA/X)  .  (2.20) 

We  note  that  fA  is  bilinear  in  (X,x)  and  each  component  of 

r,  is  of  the  form 
A 

X,  X2  X3 

ZT  T  _  3  1aT  3  ZX  3  x 

Xl'X2'X3  J 

where  |I2|  +  | |  <  order  (A). 

Finally,  we  shall  need  the  following  lemma  in  proving 
our  sufficiency  result  in  section  5 .  Special  cases  of  the 
result  are  well  known  as  envelope  theorems. 

V 

Lemma.  Let  h(x,y)  ,  with  x  «  H,  y  «  S  =  K  ,  and  S  compact 
be  continuous  in  (x,y)  and  continuously  differentiable  in  x 
for  each  fixed  y.  Define 

h°  (x)  ■  max{h(x,y)  :  y  <  S}  »  h(x,  y°  (x) ) ,  (2.21) 

o 

and  assume  that  h  (x)  is  also  continuously  differentiable. 


Then 
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h° ' (x)  -  ||(x,  y°(x))  ,  x  c  S  .  (2.22) 


Proof: 

From  the  definition  of  derivative  we  have 
h0'  (x)  -  J|(x,  y°  (x) )  • 

-  11,  h-[?'  -feis'.f.  .  12.23 

x'-x  x  ”x 

If  we  let  L  denote  the  limit  and  DQ'  denote  the  difference 
quotient  in  (2.23) ,  our  uniformity  assumptions  enable  us  to 
conclude  that  given  e  >  0,  there  exists  S  >  0,  such  that 
| x 1  “ x |  <  5  implies  |DQ'  -L|  <  e.  Now  (2.21)  implies 

h(x'  ,  y°(x’))  -  h (x '  ,y°(x))  *  0 

for  all  x1  /  x.  Consequently  if  we  choose  x"  such  that 
x"  -x  ■  -(x'  - x)  and  let  DQ"  denote  the  corresponding 
difference  quotient,  then  | DQ"  - L j  <  e  and  DQ'  ,  DQ"  have 
opposite  sign.  It  follows  that 

|  DQ"  -DQ'  I  S  I  DQ"  -L|  +  |  L  -  DQ '  |  <  2e 

implying  both 


DQ"  <  2e  and  DQ'  <  2e  . 


We  conclude 


L  <  |L  -DQ*  |  +  |DQ'  |  -  3e 


for  every  e  >  0  and  therefore  L  ■  0,  completing  the  proof 


3.  The  Optimal  Control  Problem 

In  this  section  we  give  a  precise  statement  of  the 
optimal  control  problem  (1.1)  and  (1.2)  using  the  notation 
of  section  2. 

In  (1.2)  let  A  be  a  linear  partial  differential 
operator  defined  in  the  domain  R  e  »n  as  specified  in 
section  2.  That  is,  if  x  «  C*(R)  then 

Ax (t)  ■  IT  a_(t)3Jx(t). 


(3.1) 


for  all  t  <  £2  where  we  assume  aj  «  (£2)  ,  the  space  of 

functions  on  A  continuously  differentiable  up  to  order  |J|. 
The  set  of  admissible  controls  u(t)  is  specified  by 
requiring  that  u(*)  be  measurable  and  map  £2  **■  K  c  we 

assume  that  the  function  f(x,u,t)  is  continuously 
differentiable,  on  E  «  K  k  II.  To  complete  the  specification 
of  (1.2)  we  introduce  the  boundary  conditions 

B^Jctt)  ■  g^(t)  1  s  k  s»  r  t  e  3£2  (3.2) 

where  each  is  a  linear  partial  differential  operator  of 

order  less  theui  A  and  defined  for  t  <  3£2  where  3(2  denotes 
the  boundary  of  (2  as  is  customary.  It  is  often  convenient 
to  use  the  vector  notation  Bx(t)  *  g(t)  instead  of  (3.2)  . 
Finally,  we  assume  that  the  boundary  value  problem  defined  by 
(1.2)  and  (3.2)  has  a  unique  solution  for  each  admissible 
control  u(*).  In  the  objective  function  (1*1)  we  assume 
that  the  scalar  function  F  and  the  Hn  valued  function  G 
are  continuously  differentiable  on  their  domains,  and  x  is 
the  solution  of  (1.1)  and  (3.2)  for  each  admissible  u. 

We  have  now  completed  the  specification  of  the  optimal 
control  problem.  To  summarise,  we  restate  the  problem: 

max  J(u)  ■  F(x,u,t)du>  +  [  G(x,t)  •  da  (3.3) 

u  J£2  j3£2 


subject  to 


14 


A(t)x  ■  f(x,u,t)  (3.4) 

with  boundary  condition 

B(t)x  *  g(t)  on  (3.5) 

with  admissible  controls 

us  ft  -*■  K  e  Rn  .  (3.6) 

Necessary  conditions  for  (3.3)  -  (3.6)  are  derived  in  the  next 
section . 

4.  Derivation  of  Necessary  Conditions 

In  this  section  A'  denotes  the  adjoint  operator  to 
A  and  I*A  ( • ,  • )  is  the  corresponding  boundary  form  as  defined 
in  section  2.  The  Hamiltonian  is  defined  by  the  function 

H(x,U/X,t)  *  F(x,u,t)  +  Xf(x,u,t)  (4.1) 

Analogously  to  the  adjoint  aquations  in  optimal  control  theory 
we  shall  find  that  the  adjoint  function  X(t),  t  c  ft  satisfies 

A' X (t)  -  3  H(x(t),  u(t)  ,  X(t)  ,  t)  (4.2) 


where  u(t)  and  x(t) ,  t  «  ft  are  the  optimal  control  and 


IS 


corresponding  state  functions.  In  addition,  X  satisfies  the 
adjoint  boundary  condition  given  by 

axG(5,t>5  -  rA(x,e),  t  «  a  n  (4.3) 

for  all  functions  5(t)  defined  in  ..a  neighbourhood  of  3ft  and 
satisfying 

# 

B?(t)  *  o,  t  s  an  (4.3») 


where  B  is  defined  in  (3.5).  Note  that  both  3  G  and  T 

X  A 

are  n  vectors . 

The  actual  derivation  proceeds  as  follows.  We  let  J 

cL 

denote  the  augmented  objective  function 


J_  (u) 

A 


J(u)  +  [  X(t)[f[x(t),  u(t)  ,  t)  -Ax(t)]du 

Jn 

*  I  Hdw+|  G  •  do  -  ( X ,  Ax) 

Jn  Jan 


(4.4) 


where  we  note  that  the  derivation  of  the  second  expression 
uses  (3.3)  and  (4.1).  If  x  satisfies  (3.4)  and  (3.5)  then, 
of  course,  J  ■  J.  We  assume  also  that  X  satisfies  (4.2) 

fli 

and  (4.3).  Then  using  (2.15)  and  the  generalized  Green's 
theorem  (2.18),  we  rewrite  (4.4)  as 


H  du  + 


n 


f  (G-r.)  -do  -  (A'X,  X) 

Jan  A 


[h  -  (A'X)x]du»  ♦  f  (G-r.)dc  . 
Jn  ^n 


(4.5) 


Finally  consider  the  first  order  variation  fij  of  J 

A  A 

produced  by  a  change  fiu  in  u,  uniformly  small  for  each  t. 
For  each  t ,  we  shall  denote  the  first  order  change  in  H 
due  to  a  change  <Sx  in  x  by  3X  H  <Sx  (u  held  constant) , 
and  the  change  in  H  due  to  the  change  in  u  by  AH  (x  held 
constant)  .  Likewise  3X  S  fix--  denotes  the  first  order  part 
of  the  change  in  S  due  to  a  change  in  x.  Then 

«.  -  f  [<3.H*  A'X)  fix  +  AH]du»  . 

Jfl  * 

+  f  [3  G(x(t)  ,  t)  fix(t)  -  F.  (X  (t)  ,  fix(t)  )]da 
j3fl  x  A 

*  |  AH  da»  (4.6) 

'ft 

after  using  (3.4),  (3.5),  (4.2)  and  (4.3) .  Suppose  now  that  u 
is  an  optimal  controller  and  u  differs  from  u  only  within  a 
small  neighbourhood  of  the  given  point  t.  Then  fiJ  s  0  if 
u  corresponds  to  a  maximum  of  J  and  standard  techniques 
[4]  can  be  used  to  show 


AH(x(t),  u ( t) ,  X(t),  t)  s  0.  (4.7) 

This  implies  that  if  u(t)  is  an  optimal  control  function  and 
x(t),  X  (t)  aure  corresponding  state  and  adjoint  functions 
then  (4.7)  implies 


H(x,u,  X, t)  ■  maxn  H (x , u, X , t) 


(4.8) 
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which  is  the  maximum  principle.  To  summarise/  the  necessary 
optimality  conditions  are  (3.4),  (3.5),  (3.6),  (4.1),  (4.2), 
(4.3) ,  (4.3*) ,  and  (4.8)  . 

5.  Sufficiency  Conditions 

In  this  section  we  derive  sufficiency  conditions  which 
generalize  sufficiency  conditions  in  ordinary  control  problems 
[2,  7,  8,  etc.].  The  assumptions  required  to  prove  these 
results  quite  often  hold  in  economics  problems. 

The  key  assumption  is  the  concavity  of  the  derived 
Hamiltonian  H° ,  where 

H°(x,X,t)  »  maxu  H(x,u,X,t) .  (5.1) 

Specifically  we  assume  that  H°  is  concave  as  a  function  of 
x,  for  any  given  (X,t) ,  that  is 

H°(x,X,t)  -  H°(x*,X,t)  S  3xH°(x*,X,t) (x-x*) .  (5.2) 

In  fact,  as  we  shall  see  shortly,  we  only  need  this  inequality 
when  x*  is  the  optimum  trajectory,  but  since  such  a 
trajectory  is  not  normally  known  at  the  outset,  a  concavity 
hypothesis  is  more  useful.  We  assume  also  that  the  function 
G  is  oonoavt-lik* ,  that  is 

G(x*,t)  -  G(x,t)  i  3xG(x-x*,  t)(x*-x) 


(5.3) 
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for  each  t,  x,  x*.  (5.3)  holds  trivially  if  G(x,t)  is  linaar 

in  x.  Combining  (5.1)  and  (5.2)  wa  obtain 


H(x,u,X,t)  s  H°(x,X,t)  <  8°(x*,Xft)  +  3xH°(x*,X,t) (x-x*).  (5.5) 


Undar  tha  assumption  that  both  H  and  H  ara  continuously 
diffarantiabla  functions  of  x  wa  hava  from  tha  lemma  in  Saction  2 


3  H  (x,X,t)  »  3  H(x,u(x)  ,X,tj 


(5.6) 


where  u(x)  is  tha  maximising  point  in  (5.1).  We  now  substitute 
tha  expression  for  the  Hamiltonian  (4.1)  in  (5.51  and  note  (5.6) 
and  (4.2)  to  write 


F(x*,u*,t)  -  F(x,u,t) 


2  X*ff(x,u,t)  -f(x*,u*,t)] 


-  3xH(x*,u*,X*,t) (x-x*) 


X*A(x(t)  -x*(t))  -  A’X*(t)  (x(t)  -x*(t))  (5.7) 


1  n 

L 


where  x*,  u* ,  X*  denote  values  at  optimum.  Integrating  over 

ft  and  adding  tha  terms  I  G(x*,t)  •  do  -  G(x,t)  •  do  to 

'3fl  J3fl 

both  sides  wa  obtain 


t 


r 
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J(u*)  -  J(u) 

*  (A*,  A(x-x*))  -  (A' A*,  x-x*) 

+  [  G(x*,t)  •  do  -  I  G(x,t)  •  do 

J3fl  J3G 

a  f  Ti  (A#,  x-**)  •  da  , 

J3Q  A 

-  f  3  G(x-x*,  t) (x-x*) 

ha  x 

■  o 

where  we  have  used  (2.8),  (S.3)  and  (4.3).  Thus  J(u*)  a  J(u) 
which  complatas  the  argument. 

6 .  Extensions 

I .  This  work  can  be  extended  to  the  state  x  .  to  be  a 
vector.  Then  A  becomes  a  matrix  whose  entries  A^  are 
linear  partial  differential  operators.  The  formal  adjoint  A' 
is  then  the  transposed  matrix  of  formal  adjoints  {a^}. 

II.  In  many  practical  problems  the  adjoint  boundary 
condition  (4.3),  (4.3')  can  be  reduced  to  a  condition  of  the 
form  B~A(t)  *  0  where  B''  is  a  vector  of  partial  differential 
operators.  Such  a  simplification  is  very  useful  in  simplifying 
the  handling  of  the  adjoint  equations. 

ZZI.  Extension  of  this  paper  to  include  boundary  controls 


is  currently  under  investigation 
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